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Abstract. A relational structure is a core, if all its endomorphisms are embeddings. This 
notion is important for computational complexity classification of constraint satisfaction 
problems. It is a fundamental fact that every finite structure has a core, i.e., an endomor- 
phism such that the structure induced by its image is a core; moreover, the core is unique 
up to isomorphism. 

We prove that every u> -categorical structure has a core. Moreover, every tj-categorical 
structure is homomorphically equivalent to a model-complete core, which is unique up to 
isomorphism, and which is finite or ^-categorical. We discuss consequences for constraint 
satisfaction with ^-categorical templates. 



The notion of the core of a finite structure is a central concept in structural combi- 
natorics [22]. It can be defined in many equivalent ways, one of which is as follows (for 
detailed definitions see Section [2]). A finite structure S is a core if every endomorphism of 
S is an automorphism. A structure C is called a core of S if C is a core and is isomorphic 
to a substructure induced by the image of an endomorphism of S. 

For finite structures S it is well-known and easy to prove that a core of S always exists, 
and is unique up to isomorphism; see e.g. [22]. We therefore speak of the core of a finite 
relational structure S. For infinite structures various core-like properties were studied by 
Bauslaugh [5,6]. In general, infinite structures might not have a core in the sense introduced 
above, see [5,6]. 

An important application of the concept of a core is the theory of constraint satisfaction 
in computer science. Roughly speaking, a constraint satisfaction problem is a computational 
problem where the input consists of a given set of variables and a set of constraints that 
are imposed on these variables, and where the task is to assign values to the variables such 
that all the constraints are satisfied. Such problems appear in numerous areas of computer 
science, and constraint satisfaction problems attracted considerable attention in artificial 
intelligence. 
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Many constraint satisfaction problems can be formalized as follows, using the concept of 
structure homomorphisms; again, for a detailed introduction of all the involved concepts see 
Section [2 Let T be a finite or infinite structure with relational signature r. The constraint 
satisfaction problem (CSP) for the so-called template T is the following computational 
problem. 

csp(r) 

INSTANCE: A finite structure S of the same relational signature r as the template T. 
QUESTION: Is there a homomorphism h : S — > T? 

We want to stress that T is not part of the input; each T defines a computational 
problem. Note that if two structures T and A are homomorphically equivalent, i.e., there 
is a homomorphism from T to A and from A to T, then these structures have the same 
constraint satisfaction problem. In particular, a structure S and its core C have the same 
constraint satisfaction problem. 

For a finite template T, the computational problem CSP(T) is clearly contained in 
NP. A classification of tractable and NP-hard constraint satisfaction problems with a finite 
template is intensively studied, but still not complete. See [12,17,21], just to mention a few 
highlights on that subject. In all these approaches, the authors make use of the assumption 
that the templates of the constraint satisfaction problems under consideration are cores. 

The class of constraint satisfaction problems with an infinite template was not yet 
studied systematically. It turns out that many interesting computational problems can be 
formulated with templates that are lo- categorical. The concept of w-categoricity is central 
in classical model theory, and will be introduced carefully in Section The following 
list of well-known computational problems can all be formulated with a countably infinite 
template that is w-categorical. 

• Allen's interval algebra, and all its fragments [4,23,27,31] 

• Problems in phylogenetic analysis [20,32] 

• Tree description constraints in computational linguistics [10,11,15] 

• Computational problems in the theory of relation algebras [16,24,30] 

• All CSPs in monotone monadic SNP without inequality [9, 17] 

Moreover, every constraint satisfaction problem with a finite template A can also be 
formulated with an w-categorical template. 

In this article we study how the notion of the core of a finite relational structure can 
be generalized to w-categorical structures. We say that an infinite structure T is a core, if 
every endomorphism e of T is an embedding, i.e., e is an isomorphism between T and the 
structure induced by the image of e in T. This is indeed a generalization of the previous 
definition for finite structures, since every embedding of a finite structure A in A is an 
automorphism. As in the finite case, we say that T has a core A, if A is a core and A is 
isomorphic to the structure induced in T by the image of some endomorphism of T. 

Note that our definition of a core is fundamentally different from the notion of a core 
structure as introduced in [28] in model theory. There, a structure T is called a core structure 
of a first-order theory T if T is isomorphic to exactly one substructure of every model of T. 

Results. We will show that every w-categorical structure T has a core. Moreover, every u- 
categorical T-structure V is homomorphically equivalent to a model-complete core T c , which 
is unique up to isomorphism, and which is finite or again w-categorical. 
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If r is expanded by all primitive positive definable relations, then T is homomorphically 
equivalent to a homogeneous core T c , which is unique up to isomorphism, and which is fi- 
nite or w-categorical. The condition that T is expanded by all primitive positive definable 
relations is natural in the context of constraint satisfaction, since a relational structure and 
its expansion by all primitive positive definable relations have the same computational com- 
plexity (see e.g. [26]). Since T c is in this case homogeneous, it admits quantified elimination 
and has a V3-axiomatization. 

In Section [7] we also prove the following result, which has consequences for the theory of 
constraint satisfaction. If we expand T c by a singleton relation, then the resulting constraint 
satisfaction problem has the same computational complexity as CSP(T C ). This was shown 
for finite templates in [12], and is of fundamental importance for the so-called algebraic 
approach to constraint satisfaction. 

2. Cores 

Let r and A be relational structures with the same relational signature r. We use 
the same symbols for the relation symbols from r and for the respective relations in T, 
and the same symbols to denote a structure and its universe. A mapping / : T — > A is 
called a homomorphism, if for all relation symbols R € r and x\, . . . ,x n € T the relation 
R(f(xi), . . . , f{x n )) holds in A whenever R(x±, . . . ,x n ) holds in T. A homomorphism is 
called strong, if R(xi, . . . , x n ) holds in T if and only if R(f(xi), . . . , f(x n )) holds in A. 
An injective strong homomorphism from T to A is also called an embedding; if there exists 
an embedding of T into A, then A is called an extension of T. A homomorphism from T 
to r is called an endomorphism of T, and a bijective strong endomorphism of T is called 
an automorphism of T. Two structures T and A are called homomorphically equivalent, if 
there is a homomorphism from T to A and a homomorphism from A to T. 

Definition 1. A (finite or infinite) structure T is a core if every endomorphisms of T is an 
embedding. A core T is called a core of A if T is the image of an endomorphism of A. 

Homomorphisms that are not embeddings are called strict. The above definition says 
that cores do not have strict endomorphisms. For finite cores it clearly holds that every 
endomorphism is an automorphism, hence our definition is a generalization of the notion of 
a core for finite structures. 

The following proposition illustrates the relevance of cores for constraint satisfaction 
problems. 

Proposition 2. Let T be a relational structure. Then CSP(T) can be formulated as a 
constraint satisfaction problem with a finite template if and only if T has a finite core. 

Proof. Clearly, if T has a finite core A, then CSP(r) is equivalent to CSP(T). Conversely, 
suppose CSP(r) equals CSP(A) for a finite template A. This implies that every finite 
substructure of T homomorphically maps to A. A standard compactness argument shows 
that there is a homomorphism from T to A, and therefore T has a finite core. □ 
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3. COUNTABLY CATEGORICAL STRUCTURES 

Finite structures are up to isomorphism determined by their first-order theory. We 
can not expect this for infinite structures: by the theorem of Lowenheim-Skolem, every 
consistent theory with a model of cardinality A has models of arbitrary cardinality > A. 
However, it might still be the case that all models of a certain cardinality are isomorphic. 
If this is the case for the countably infinite models, we call the theory u- categorical. A 
countably infinite structure T is called to-categorical, if its first-order theory Th(T) (i.e., 
the set of all first-order sentences that hold in T, where the atomic formulas are built from 
the symbols in r and equality) is w-categorical. Throughout the paper we only consider 
relational and at most countable structures and signatures. Despite the powerful theorems 
quoted below, the class of a;-categorical structures remains somewhat mysterious, and all 
classification results require some additional properties (stability in e.g. [29] , or homogeneity 
in [14]). 

Let G be a permutation group, and let G act on X. For all k > 1 there is a natural 
action of G on the set X k of /c-tuples over X, defined by (x\, . . . , x^jir = (xitt, . . . , x^) for 
permutations tt from G. An orbit of /c-tuples in G (with respect to this action) is a smallest 
subset S of X k such that x € S implies that xn G S for all it € G. All notions used here 
are standard and can be found e.g. in [25]. 

Theorem 3 (Engeler, Ryll-Nardzewski, Svenonius). The following properties of a countably 
infinite structure T are equivalent: 

(1) the structure T is u -categorical; 

(2) for each n > 1, the automorphism group ofT contains finitely many orbits of n-tuples; 

(3) for each n > 1, T admits finitely many inequivalent formulas with n free variables. □ 

Permutation groups with the second property in Theorem[3]are called oligomorphic [13]. 
A famous example of an w-categorical structure is (Q,<), the dense linear order of the 
rational numbers. A famous structure that is not w-categorical is (N, <). This structure 
has an infinite number of orbits of pairs, and hence can not be w-categorical, because 
whenever x\ — X2 ^ y\ — y2, then (x\,X2) can not be in the same orbit as (2/1,2/2)- 

Constraint satisfaction with w-categorical templates is a strict extension of constraint 
satisfaction with finite templates. Let A be a finite template for a constraint satisfaction 
problem. To formulate CSP(A) with an w-categorical template, add for each vertex v in 
A a countably infinite number of copies v\ , V2, ■ ■ ■ , such that for all i > 1 the relation 
R(. . . , Vi, . . . ) holds in the resulting structure T if and only if R(. . . , v, . . . ) holds in A. It 
is not hard to see that the structure T is w-categorical, and that the core of T is isomorphic 
to the core of A. Clearly, there are constraint satisfaction problems with w-categorical 
templates that can not be formulated with finite templates: all the classes of computational 
problems mentioned in the introduction contain examples of such problems. 

To illustrate the concepts we have seen so far, we formulate several well-known compu- 
tational problems as constraint satisfaction problems with w-categorical templates. In all 
these examples, it is fairly easy to check that the chosen template is a core. We also do 
not always prove w-categoricity of these structures, and that they indeed have the specified 
constraint satisfaction problem. But references with further illustration and proofs will be 
given. Three more examples follow at the end of Section [31 and for these examples the ver- 
ification that the structures are indeed cores is more interesting. We do not present these 
examples here, because we need the concept of amalgamation to define them conveniently, 
which will be introduced in Section [U 
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Betweenness. The hardness of many fragments of Allen's Interval Algebra [4, 27] can be 
proven easily by reduction from Betweenness, an important NP-hard problem that can be 
found in Garey and Johnson [19]. Given a finite set V, and a collection C of ordered triples 
(x, y, z) of distinct elements from V , the computational question is whether there is an 
injective function / : V — > {1, . . . , \V\} such that, for each (a,b,c) € V, we have either 
f(a) < f(b) < /(c) or /(c) < f(b) < /(a). The formulation as a constraint satisfaction 
problem is straightforward, using for instance the rational numbers as the base set of the 
template. 

Switching- Acyclicity. Given a digraph D = (V; E), can we partition the vertices V into two 
parts, such that the graph that arises from D by switching all arcs between the two parts 
is acyclic? To formulate this as a constraint satisfaction problem with an w-categorical 
template, consider a dense subset X of Q, and switch the order < between the elements of 
X and Q — X, and leave the edges within X and within Q — X unchanged. The resulting 
structure is called S(2) and is isomorphic for all choices of dense sets X, see e.g. [14]. The 
constraint satisfaction problem of S*(2) is the problem described above [11]. For equivalent 
definitions of S(2) and an hardness-proof of its constraint satisfaction problem, see [7, 11]. 

Partial tree descriptions. Our next example is a computational problem that was studied 
in computational linguistics [15]. A polynomial time algorithm can be found in [10]. Let 
D be a digraph with two types of arcs, called ancestorship and non- ancestor ship arcs. The 
question is whether D is a consistent partial tree description, i.e., whether we can find a 
forest with oriented edges on the vertex set of D, such that for every ancestor arc in D there 
is a directed path in the forest, and for every non- ancestor arc there is no directed path 
in the forest. As shown in [11], we can formulate this problem as a constraint satisfaction 
problem with an ^-categorical template. 

Non-cores. Of course, there are plenty of u;-categorical structures that are not cores, for 
instance the Random graph R [13,25], whose core is the complete graph on a countably 
infinite set of vertices (the constraint satisfaction problem of R and K w is trivial) . 

4. Homogeneous Structures and Amalgamation Classes 

We need another concept, which is of a more combinatorial nature, and links uj- 
categoricity via homogeneity and Frai'sse's theorem to amalgamation classes. A structure is 
homogeneous (sometimes also called ultra-homogeneous [25]) if every isomorphism between 
finite substructures of T can be extended to an automorphism (in this paper, substructure 
always means induced substructure, as in [25]). A structure T admits quantifier elimination, 
if every first-order formula has in T a quantifier-free definition. 

Proposition 4 (see e.g. 2.22 in [13], and [25]). An uj- categorical structure has quantifier 
elimination if and only if it is homogeneous. A countable homogeneous structure T is u- 
categorical ifT contains finitely many relations of arity k, for all k > 1. □ 
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For an example of a homogeneous structure that is not w-categorical, consider the 
expansion of a countably infinite structure T by unary singleton predicates for each element 
in r. This structure is homogeneous, since there are no distinct isomorphic substructures 
in r, and it is not w-categorical, since the number of orbits in the automorphism group of 
r is infinite. 

The next theorem asserts that a countable homogeneous structure is up to isomorphism 
characterized by its age. The age of a relational structure T is the set of finite structures that 
embed into T (this is terminology that goes back to Frai'sse [18]). A class of finite relational 
structures C is an amalgamation class if C is nonempty, closed under isomorphisms and 
taking substructures, and has the amalgamation property, which says that for all A, B±,B2 £ 
C and embeddings e\ : A — > B\ and e2 : A — > B2 there exists C G C and embeddings 
fi : B\ — > C and fi ■ B2 — > C such that f\e\ = f2&2- 

Theorem 5 (Frai'sse [18]). A countable class C of 'finite relational structures with countable 
signature is the age of a countable homogeneous structure if and only if C is an amalgamation 
class. In this case the homogeneous structure is up to isomorphism unique and called the 
Fraisse-limit of C. □ 

The following templates of well-known constraint satisfaction problems are easily de- 
fined with amalgamation classes. 

Triangle-freeness. Given a graph G, is G triangle-free? Clearly, this problem can be solved 
in polynomial time. However, it can not be formulated as a constraint satisfaction problem 
with a finite template. To formulate this problem as a constraint satisfaction problem with 
an w-categorical template, note that the class of all triangle-free graphs is an amalgamation 
class. Let us denote its Fraisse-limit by fl It is well-known and not hard to see [13] that 
this graph is up to isomorphism uniquely determined by the fact that it is triangle-free and 
has the following extension property: for all finite subsets A, B of vertices of -ft such that 
all vertices in A are pairwise not adjacent there exists a vertex z in $ that is not in A, not 
in B, adjacent to all vertices in A, and not adjacent to all vertices in B. Clearly, CSP(^) 
is the computational problem described above. 

We claim that the structure -ft is a core. Suppose otherwise that there is a strict 
endomorphism e. If e(u) = e(v), then u and v can not be connected in <fi. We apply the 
extension property twice to derive that there must be adjacent vertices w and w' in ^ such 
that w' is connected to u and w is connected to v. But then, e(u) = e(v), e(w), e(w') form 
a triangle, a contradiction. Hence, since e is strict, there must be non-adjacent vertices a 
and b such that e(a) is adjacent to e(b). Again by the extension property ^ contains a 
vertex w that is adjacent to both a and b. But then e(o), e(w), e(b) form a triangle, again 
a contradiction. Therefore every endomorphism of <fi is an embedding. 

No-mono-tri. The structure ^], i.e., the structure that consists of two copies of ^, where 
all vertices between the two copies are linked, has an interesting constraint satisfaction 
problem, which can be formulated as follows: Given a graph, can we partition its vertices 
into two parts such that both parts do not contain a triangle? This problem is a rather 
typical example from the class monotone monadic SNP without inequality (MMSNP), a 
fragment of existential second-order logic introduced in [17] in the context of constraint 
satisfaction. A general result on so-called G-free colorability implies its NP-hardness [1]. 
Every constraint satisfaction problem in MMSNP can be formulated with an w-categorical 
template [9]. The construction given in [9] also shows that the above structure is 
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cu-categorical. Similarly as in the previous example for the graph -jd, it is not hard to show- 
that [<fi, <fi\ is a core. 

Quartet compatibility. The next example is an important structure in the theory of infinite 
permutation groups [13]. A boron tree is a finite tree in which all vertices have degree one 
[hydrogen atoms) or degree three {boron atoms). On the hydrogen atoms of a boron tree 
we can define a quaternary relation xy\uv that holds when the paths joining x to y and u 
to v are disjoint. The class of all structures T> with a quaternary relation that stem from 
a boron tree as defined above is an amalgamation class [2]. Let D be the Frai'sse-limit of 
D. Then CSP(D) is a well-known NP-hard problem [32] that was independently studied 
in phylogenetic analysis (without any reference to constraint satisfaction), and is called 
quartet- compatibility: Given a collection C of quartets xy\uv over a set X, is there some 
tree with leaf set X such that for each quadruple xy\uv in C the paths from x to y and 
from u to v do not have common vertices? 

Rooted triple consistency. The next problem is studied in phylogenetic analysis, again with- 
out notice that the problem can be stated as a constraint satisfaction problem. If we fix 
a point a in the previous structure D and consider the ternary relation ':' defined by 
x : yz 44> ax\yz, we again obtain an cu-categorical structure (this is a C-set in [2]). The age 
of this structure now contains the finite structures T that come from finite rooted trees, and 
the relation x : yz says that the least common ancestor of y and z is strictly below the least 
common ancestor of x, y, and z in the tree T. The corresponding constraint satisfaction 
problem is known as the rooted triple consistency problem [32], and tractable. The first 
polynomial time algorithm for this problem goes back to [3], motivated by a question in 
database theory. 

5. Existential Positive Expansions 

In this section we study various syntactic restrictions of first-order logic. Recall that 
if relations are added to a given r-structure T then the resulting structure T' is called an 
expansion of T, and T is called the r-reduct of T'. 

A first-order formula <\> is primitive (primitive positive), if it is of the form 

3x.tpi A ■ ■ • A ipk 

where ipi are literals (atomic formulas) that might include the equality relation. It is called 
existential (existential positive), if it is of the form 3x.^ where \I/ is quantifier- free (and 
negation- free). The strongest of these four syntactic restrictions, primitive positivity, is im- 
portant for constraint satisfaction, since the expansion of a template with primitive positive 
definable (short, pp-definable) relations does not change the complexity of the corresponding 
constraint satisfaction problem. This is an easy observation, see e.g. [26]. 

A mapping of a r-structure T to a r-structure A is called elementary, if it preserves all 
first-order r-formulas (this is a standard notion in model-theory [25]). A theory T is called 
model- complete, if all embeddings between models of T are elementary. In the case that T 
is the theory of a structure V, we also say that V is model-complete, as usual. If T is a first- 
order theory, then V is called an existentially complete model of T if for every existential 
formula <f>, every tuple o in T, and every model A of T such that T is a substructure of A 
and 4>(a) holds in A, the formula 4>(a) also holds in T. 
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Model-completeness can be characterized in many different ways. We say that two 
formulas (f), ip are equivalent with respect to a theory T if in every model of T the formulas 
4> and -0 define the same relations. 

Proposition 6 (Theorem 7.3.1 in [25]). Let T be a first-order theory. Then the following 
are equivalent. 

• T is model- complete 

• Every model ofT is an existentially complete model ofT 

• Every first- order formula is equivalent to an existential formula with respect to T. Q 

An cj-categorical structure is model-complete if and only if its first-order theory is 
equivalent to a set of V3-sentences, i.e., sentences of the form Yx3y(f)(x, y), where <p is 
quantifier-free (see e.g. Theorem 7.3.3 and 7.3.4 in [25]). The following proposition follows 
directly from Theorem 7.2.1 in [25]. 

Proposition 7. Let V be a relational structure, and let T be the set of allVB- sentences that 
hold in r. Then there exists an extension of T that is an existentially complete model of 



The case that all existential formulas are equivalent to existential positive formulas can 
be characterized in a different way. 

Lemma 8. Let T be a first-order theory such that every homomorphism between models of 
T is an embedding. Then every existential formula is equivalent to an existential positive 
formula with respect to T . 

Proof. A formula <j> is equivalent to an existential positive formula with respect to a theory 
T if (f> is preserved by all homomorphism^] between models of T; this fact is stated as 
Exercise 2 in Section 5.5 in [25]. Let / be a homomorphism between two models of T. 
By assumption, / is an embedding, and therefore clearly preserves all existential formulas. 
Hence, all existential formulas are equivalent to an existential positive formula with respect 



The following lemma will be useful to construct homogeneous models. 

Lemma 9. If T is a structure that has been expanded by all primitive definable relations, 
then there is a homogeneous structure with the same age as T. 

Proof. Let a = (ai, . . . , a&) be a tuple of elements from T, let B\,B2 be finite induced 
substructures of T and e\ : a — > B\ and e2 : a — > be embeddings. Since there are 
relation symbols for every primitive formula in the signature, there is a relation R\ that 
holds on the tuple ei(a) in T, and is defined by the following primitive formula <p. Let 
b±, . . . , b m be the elements of B\. The formula <p has k free variables xi, . . . ,Xk and has the 
form <f> := 3xk+i, ■ ■ ■ ,x m ip. The formula tp is a conjunction of literals defined as follows. 
If there is an l-&ry relation symbol R in the signature of F such that R holds on elements 
bi 1} . . . ,bi t in B, 1 < i\, . . . , i\ < m, then -0 contains the conjunct R(xi 1 , ... ,2^). If the 
relation R does not hold on 6^, ... , bi t in B, then ip contains the conjunct -'R(xi 1 , . . . , x^). 
Moreover, ifi contains conjuncts of the form Xi ^ Xj for all distinct indices i, j from 1, . . . , m. 

^We say that a formula <j> is preserved by all homomorphisms between models of T if whenever ft is a 
homomorphism from Mi to M2 that maps a tuple a of elements from Mi pointwise to a tuple b of elements 
from M2, where Mi and M2 are models of T, and a satisfies <fi in Mi, then b satisfies <j> in M2. 



T. 



□ 



to T, 



□ 



CORES OF COUNTABLY CATEGORICAL STRUCTURES 



9 



We also have a relation R2 corresponding in an analogous way to B2 where the points 
from B2 — 62(a) are existentially quantified, and which holds on 62(a) in V. Since e\ and 
e2 are embeddings, these relations also hold on a in T. They assert that we can find an 
extension C of the structure induced by a and embeddings f\ : B\ — > C, f2 ■ B2 — > C 
such that /id = /2e2- Thus, the age of T has the amalgamation property, and Theorem [5] 
implies that there is a homogeneous structure with the same age. □ 

We can combine Lemma [8] and Lemma [9] and obtain the following. 

Corollary 10. Let T be a structure that has been expanded by all existential positive defin- 
able relations. If all homomorphisms between models of Th(T) are embeddings, then there 
is a homogeneous structure with the same age as T. 

Proof. Proposition [8] shows that every existential, and in particular every primitive formula 
is in r equivalent to an existential positive formula. Hence, T is also expanded by all 
primitive definable relations, and Proposition [9] shows that there is a homogeneous structure 
that has the same age as T. □ 



6. Cores of Countably Categorical Structures 

In this section we state and prove the main results of the paper. We start with a 
proposition on the existence of a 'youngest' endomorphic image of an u;-categorical structure. 
The proof employs a typical technique for u;-categorical structures. 

Proposition 11. Let T be an u- categorical relational r-structure. Then there exists an 
endomorphism cofT such that for every other endomorphism g, all finite substructures of 
c(r) embed into g(T). This is, there exists an endomorphic image ofT of smallest age. 

Proof. Let S be the set of all finite r-structures S such that there is an endomorphism g of 
r so that S does not embed into g(T). We have to show that there is an endomorphism c 
such that no structure from S embeds into c(T). For the construction of c we consider the 
following tree. Let 01,02, ■■ ■ be an enumeration of T. The vertices on level n of the tree 
are labeled with equivalence classes of good homomorphisms from the structure induced 
by {ai, . . . ,a n } to T. A homomorphism h is good, if no structure from S embeds into the 
structure induced by h({a±, . . . , a n }) in T. Two homomorphisms g± and 52 are equivalent, 
if there exists an automorphism a of T such that g\ = g2a. Clearly, if a homomorphism 
is good, then all equivalent homomorphisms and all restrictions are also good. A vertex 
u on level n + 1 in the tree is connected to a vertex v on level re, if some homomorphism 
from u is the restriction of some homomorphism from v. Because of w-categoricity, the 
tree is finitely branching. We want to show that the tree has vertices on each level re, 
and iteratively construct a sequence hi,kz, • • • ,hk of homomorphisms from {ai, . . . ,a n } 
to r, where the last endomorphism hk induces a good homomorphism. Initially, if no 
structure from S imbeds into the structure induced by {ai, . . . ,a n }, we can choose the 
identity as a good homomorphism. Otherwise, there is a structure S G S that embeds into 
the structure induced on {ai,...,a n }, and an endomorphism e such that e(r) does not 
contain S. The mapping h\ restricted to {ai, . . . ,a n } is a strict homomorphism, because 
if it was an embedding, S embeds into the structure induced by the image of {ai, . . . , a n } 
under this mapping, which is by assumption impossible. 
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In step i, if no structure in 8 embeds into the structure induced by hi({ai , . . . , a n }), then 
hi is a good homomorphism, and we are again done. Otherwise there is an endomorphism e 
of T and a structure S € S that embeds into the structure induced by hi({a±, . . . , a n }), such 
that S does not embed into e(T). We can then define a homomorphism hi + \ : {a%, . . . , a n } — ► 
T by hi + \(x) := e(hi(x)), which is again a strict homomorphism. Since in the sequence of 
structures induced by h\{{a\, . . . , a n }), h2({a%, . . . , a n }), . . . either the number of vertices 
decreases or the number of tuples in relations increases, and since T is w-categorical, the 
sequence has to be finite. Hence, there exists a good homomorphism from {a±, . . . , a n } to 
r, for all n > 0. By Konig's tree lemma, there exists an infinite path in the tree. Since 
adjacency in the tree was defined by restriction between homomorphisms, this path defines 
an endomorphism c of T. By construction, no structure in S embeds into c(T). □ 

In the following, c(T) denotes the structure induced by the image of the endomorphism 
c in r that was constructed in Proposition PTT1 Note that Proposition [11] says that all cores 
of r have the same age as c(T). We will use the following well-known lemma several times 
(see e.g. Section 2.6 in [13]). It can be shown by a similar application of Konig's lemma as 
in the proof of the previous proposition. 

Lemma 12. Let T be a relational structure whose age is contained in the age of an to- 
categorical structure A. Then T embeds into A. □ 

The following lemma is a central step in our arguments. 

Lemma 13. Let T be an to -categorical r-structure that contains all existential positive 
definable relations. Then every homomorphism between two structures T± and of the 
same age as c(T) is an embedding. 

Proof. Since T is w-categorcial and because the age of Ti and T2 is contained in the age of T, 
Lemma [T2l implies that both T% and T2 embed into T. For simplicity of notation, we assume 
that Ti and T2 are substructures of T. Now suppose for contradiction that / : T\ — > T2 
is a homomorphism that is not an embedding, this is, for some tuple u = (ui, . . . , u^) of 
elements and some fc-ary relation R in Ti the mapping / does not preserve the formula 
—>R(u) or does not preserve the formula u\ 7^ U2- We will then construct an endomorphism 
h of T such that h(T) does not contain a copy of the substructure S induced by u in T±. 
This is a contradiction: On the one hand S is a substructure of c(T), since Ti has the same 
theory and thus the same age as c(T). On the other hand, since S is not a substructure of 
h(F), Proposition [TT1 savs that S is not a substructure of c(T). 

To construct this homomorphism h we consider an infinite but finitely branching tree. 
The vertices on level n in this tree will be labeled by equivalence classes of good homomor- 
phisms from {a\,a2, • • • , a n } to T, where oj, 02, • • • is an enumeration of T. A homomor- 
phism g on level n is good, if the structure induced by g{{a\, . . . , a n }) does not contain an 
induced copy of S. Two homomorphisms gi and 52 ar e equivalent if there exists an auto- 
morphism a of r such that g\ = g2Ct. Adjacency is defined by restriction; this is, two nodes 
on level n and n+ 1 are adjacent in the tree if there are representatives g\ and 52 from these 
nodes such that g\ is a restriction of g2- Clearly, all restrictions of a good homomorphism 
are again good homomorphisms, and all homomorphisms in an equivalence class are good, 
or all are not good. By w-categoricity of T, the tree has only a finite number of vertices 
on level n, and in particular it follows that the tree is finitely branching. The crucial step 
is that the tree contains vertices on every level, i.e., there exists a good homomorphism 
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h n : {ai, . . . , a n } — > T for each n > 1. We show this in the following; and here we use the 
assumption that T contains all existential positive definable relations. 

To find h n for each n > 1, we consider a sequence (h n )i>o of homomorphisms to T, 
where the domain of h n is {ai, . . . , a n }, and the domain of equals the image of h n . 
Hence, we can define the following composed homomorphism h$ : {a\, . . . ,a n } — > V by 

hn\x) := h l n {. . . h\(h n (x)) • • • )• We now define the sequence (h n )i>o- The homomorphism 
h n is the identity. For i > 0, if the structure induced by the domain of h l n does not contain 

an induced copy of S, we are done, because then hn ^ is a good homomorphism from 
{a\,...,a n } to r. Otherwise, there are elements (b\,...,bV) in the domain of h l n that 
induce in T a structure isomorphic to S. We now define h l n (b 1 -) := f(uj) for 1 < j < k, 
and want to extend this mapping to a (strict) homomorphism h l n on the other elements 
b l k+1 , . . . , b l m , m < n, in the domain of h n . Consider the formula (ft := 3x l k+1 , . . . , x l m ip with 
free variables . . . , xt, where ip is a conjunction of atomic formulas defined as follows. We 
use the structure induced by b\, . . . , b l m in T to define ift. For all ji, - ■ ■ ,ji <m, the formula 
ip contains a conjunct Rix 1 ^ , . . . , x 1 ^ ) iff ii holds on 6*. in T. It is then clear that 

the formula holds for 6^ , . . . , b\. 

Since V contains all existential positive definable relations, the existential positive for- 
mula (j) also holds on u\, . . . since these vertices induce the same structure as b\, . . . , b l k . 
Since / preserves existential positive formulas, (p also holds on f(ui), . . . , f(uk) in T. We 
thus can find witnesses rjt+i, . . . , r m in T for the variables £n| , l3 . . . , xj„ of the existential 
quantifiers in (ft, and extend hi by h l n (b l j) := rj for fc + 1 < j < m. Then h\ clearly is a ho- 
momorphism to r, which is also strict, because it does not preserve some of the inequalities 
or negated relations that hold on b\, . . . ,b\. Therefore the sequence (hn )i of homomor- 
phisms must be finite, because w-categoricity of T implies that there are only finitely many 
non-isomorphic homomorphic images of the structure induced by {a\,... ,a n } in T. Let 

hn°^ be the last homomorphism in this sequence. By construction, this mapping is a good 
homomorphism h n for every n > k. 

Therefore, the constructed tree contains vertices on all levels, and Konig's tree lemma 
asserts that the tree contains an infinite path. Since adjacency is defined by restriction, this 
path defines an infinite endomorphism h of T. The image h(T) does not contain an induced 
copy of S. This contradicts the minimality property of c(T) formulated in Proposition [TT1 
Hence, every homomorphism from T± to T2 is an embedding. □ 

Corollary 14. Every u>- categorical T-structure T has a core. 

Proof. Let A be the expansion of T by all existential positive definable relations, and let 
c(A) be the structure induced by the endomorphism c constructed in Proposition QT] for 
A. Lemma LT3l shows in particular that endomorphisms of c(A) are cores. Let To be the 
r-reduct of c(A). Clearly, since c(A) is a core, To must be a core as well. Because c is also 
an endomorphism of T, we have that Tq is a core of T. □ 

We already mentioned that the core of a finite structure is unique up to isomorphism, 
As we will see now, an w-categorical structure might have non-isomorphic cores. 

Example. Consider the following u-categorical structure T. Let Q be the set of rational 
numbers, and let X be a disjoint countably infinite set. The domain of T is Q U X, and the 
signature contains two binary relation symbols < and 7^, and one unary relation symbol P. 
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For two elements x,y in T the relation x < y holds iff x, y E Q and x is a strictly smaller 
number than y. The relation x ^ y holds iff x and y are distinct elements in T. Finally, 
P(x) holds iff x > holds. It is easy to verify that T is w-categorical. One way to see this 
is by Proposition HI because the expansion of T by the unary relation N := {x € Q | x < 0} 
is homogeneous. 

The structure T has a model-complete core, which is unique up to isomorphism, namely 
the structure A := (Q + ,<,^,P), where Q + is the set of all positive rational numbers, 
where the interpretation of < and ^ is as before, and where P denotes the trivial unary 
relation that contains all elements of the domain. The structure A is indeed the image of an 
endomorphism c of T, for instance the endomorphism that maps Q in an order-preserving 
way to the set {x € Q | x > 3}, and maps X bijectively onto {x € Q | < x < 3} 
in an arbitrary way. Let A be the structure that is induced by the image Q + of this 
endomorphism. It is easy to verify that A is a core, and hence A is a core of T. It will 
follow from Theorem [16] below that all other model-complete cores of T are isomorphic to 
A. 

However, we claim that (V, <,^,P) has other cores that are not isomorphic to A, and 
not model-complete. For instance, let V be the substructure of T induced by Q + \ (1,2), 
i.e., the set of positive rational numbers without the open interval (1,2). Then T' is the 
image of an endomorphism d constructed similarly as above, with the only difference that 
c' maps X bijectively onto {xE<Q)|0<x<lor2<x<3}. It can be verified easily that 
the structure V is a core. To see that V is not model-complete, consider the restriction d 
of the mapping c' to the elements of V. Since V is a core, the mapping d is an embedding 
of r' into r'. However, it is not an elementary embedding, since the formula 3z. x < z < y 
holds for x = d(l) and y = d{2) in T', but does not hold for x = d(l) and y = d(2) in the 
structure induced by d(T'). 

We will show that every u;-categorical structure T is always homomorphically equivalent 
to a model- complete core, which is unique up to isomorphism. For that, we first prove a 
stronger result for structures that are expanded by all existential positive definable relations. 

Proposition 15. Let T be an u- categorical t- structure. IfT contains all existential positive 
definable relations, then T is homomorphically equivalent to a homogeneous core T c , which 
is unique up to isomorphism. Moreover, T c is finite or u- categorical. 

Proof. Let T be the set of all universal sentences that are true in the structure c(T) con- 
structed in Proposition [TTJ Proposition [7] shows that there is an extension A of c(T) that 
is an existentially complete model of T. Since A satisfies the same universal sentences as 
c(r), the two structures have the same age. Because T is w-categorical, and because the 
age of A is contained in the age of V, we can apply Lemma [T2l to show that A embeds into 
r. Assume for simplicity of notation that A is a substructure of T. 

Let (j) be an existential formula, and let t be a tuple from A. We claim that <p holds 
on t in r if and only if cj) holds on t in A. One direction is clear, because A embeds into 
r. Now, consider the restriction d of c to A. Because d(A) is a substructure of c(T), and 
c(r) is a substructure of A, d is a endomorphism of A. Because A and c(T) have the same 
age, Lemma [13] implies that d is an embedding. Hence, the structure induced by d(A) is 
isomorphic to A. The embedding d preserves the existential formula (ft, and therefore (ft 
holds on d(t) in A. Because A and thus also d(A) are existentially complete models of 
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T, the formula 4> also holds on d(t) in the structure d(A) (not only in A). Since d is an 
embedding, we then know that <fi holds on t in A. 

Therefore, because the structure T is expanded by all existential positive relations, the 
structure A is also expanded by all existential positive relations. All models of Th(A) must 
have the same age as A and c(T), and hence Lemma [13] shows that every homomorphism 
between two models of Th(A) is an embedding. By Corollary [10] there is a homogeneous 
structure T c with the same age as A and c(T). Theorem [5] shows that this structure is 
unique up to isomorphism. The structure T c might be finite. If T c is infinite, and since T c 
contains only finitely many /c-ary relations for each k > 1, Proposition H] shows that T c is 
w-categorical. 

To conclude the proof, we have to show that T c is homomorphically equivalent to T. But 
this is clear since T homomorphically maps to c(T), which embeds into T c by Lemma [T2l 
Lemma [12] also shows that T c embeds into T, because it has a smaller age than T and 
because T is w-categorical. □ 

We can now prove one of the main results. 

Theorem 16. Every to -categorical r-structure T is homomorphically equivalent to a model- 
complete core T c , which is unique up to isomorphism. The core T c is uj -categorical or finite. 
All orbits of ' k -tuples in T c are pp-definable in T c . 

Proof. Let A be the expansion of T by all existential positive definable relations. By the 
previous Proposition, A has a homogeneous core A c , which is finite or w-categorical. By 
Proposition H] the structure A c has quantifier elimination, and therefore every first-order 
formula is equivalent to a quantifier-free formula (in the expanded signature of A and A c ). 
Moreover, A c is w-categorical. Let T c be the r-reduct of A c . It is well-known (and follows 
from Theorem [3] see [25]) that T c is w-categorical as well. 

We want to show that T c is a model-complete core. Suppose that e is an endomorphism 
of r c . We have to show that e is an elementary embedding. Let <fi be a first-order formula. 
Then eft is in A c equivalent to a quantifier- free formula tp. Note that e preserves existential 
positive formulas, and because A c is a core, e preserves tp as well. Therefore, e is elementary. 

Now suppose that V is some model-complete core of T. We show that V is isomorphic 
to r c . In fact, we will show that A c and the expansion A' of V' by all existential positive 
definable relations have the same age and are both homogeneous. Then Theorem [5] implies 
that A c and A', and hence also T c and V are isomorphic. 

Similarly as in the proof of Proposition [15] we prove that every existential formula that 
holds on tuples from A' in A also hold in A'. This also shows that A' is w-categorical. To 
show that A' is homogeneous, by Proposition 0] it suffices to show that A' has quantifier 
elimination. Let <fi be a first-order formula. Since V is model-complete, Proposition [6] shows 
that 4> is i n r" and therefore also in A' equivalent to an existential formula. Since A' is 
o;-categorical and a core, we can then apply Lemma [8] to show that eft is equivalent to an 
existential positive formula in A'. Since A' contains all existential positive formulas, we 
have shown that A' admits quantifier elimination. 

Finally, let R be an orbit R of fc-tuples in the automorphism group of T c . All fc-tuples in 
R induce isomorphic substructures S in A c . Because A c is homogeneous, all /c-tuples in A c 
that are isomorphic to S are contained in R. Thus, R has a definition as a conjunction <p of 
atomic formulas. We replace all relation symbols in ip that are contained in the signature of 
A c but not in the signature of r c by their existential positive definition. Then the resulting 
formula can be re-written as a disjunction of primitive positive formulas. Every disjunct is 
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either false or already defines R, because R is an orbit of fc-tuples in Aut(T c ). Hence, the 
formula is equivalent to a single disjunct, and R is pp-definable in T c . □ 

Corollary 17. If T is an lo- categorical structure that is expanded by all pp-definable re- 
lations. Then T is homomorphically equivalent to a homogeneous core T c (which is again 
unique up to isomorphism). 

Proof. Let T c be the w-categorical model-complete core that is homomorphically equivalent 
to r. Theorem 1161 shows that every orbit of fc-tuples in the automorphism group of T c has 
a pp-definition in T c , and the proof of Theorem 1161 shows that because T is expanded by all 
pp-definable relations, T c contains all pp-definable relations as well. We have also seen that 
in r c , every first-order definable relation has a quantifier-free definition in the expanded 
structure that contains all existential positive formulas. Every existential positive formula 
is a disjunction of primitive positive formulas, and therefore the structure T c has quantifier- 
elimination as well. Proposition SI implies that T c is homogeneous. □ 

We also obtain alternative characterizations of when a model-complete w-categorical 
structure is a core. A set of functions F from T to T locally generates a function g, if for 
every finite subset A of T there is a function f & F such that g(a) = f(a) for all a € A (this 
is a standard notion in universal algebra [33]). 

Theorem 18. Let T be a model- complete u- categorical structure. Then the following are 
equivalent. 

(1) T is a core. 

(2) Every first- order formula is in T equivalent to an existential positive formula. 

(3) Every endomorphism ofT is an elementary embedding. 

(4) The automorphism group ofT locally generates the endomorphism monoid ofT. 

Proof. Every first-order definable /c-ary relation R in T is the union of a finite number of 
orbits of /c-tuples of Aut(T). Theorem 1161 shows that each of these orbits has a pp-definition. 
A disjunction of primitive positive formulas can be equivalently written as an existential 
positive formula, and we obtain an existential positive definition for R, showing that 1 
implies 2. 

Since endomorphisms clearly preserve existential positive formulas, 2 implies that every 
endomorphism is elementary. 

A function from T to T is in the local closure of the automorphism group of T if and 
only if it preserves all first-order definable relations. If we assume that every endomorphism 
is elementary, then the automorphism group locally generates all endomorphisms of T, and 
therefore 3 implies 4. 

To prove that 4 implies 1, assume that T has a strict endomorphism /, i.e., there is a tu- 
ple (ui, . . . ,Uk) in T such that f{m) = f(u 2 ), or R(f(m), . . . ,f(u k )) and not R(ui, . . . ,u k ). 
Clearly, such a function / cannot be locally generated by automorphisms. □ 

7. Adding Constants to the Signature 

In relational structures T we can use singleton relations, i.e., relations of the form 
{c} for an element c from the domain of T, to model the concept of constants in first- 
order structures [25]. One of the main results in [12] says that if T is a finite core, then 
adding a singleton-relation does not increase the complexity of the corresponding constraint 
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satisfaction problem. We show that the same holds for constraint satisfaction problems 
where the template is an w-categorical model-complete core. Note that this directly applies 
to all the computational problems presented in the introduction and Sections [3] and HI 

Theorem 19. Let T be a model- complete u- categorical core, and let V be the expansion of 
r by a unary singleton relation C = {c}. // CSP(T) is tractable, then so is CSP(T'). (If 
CSP(T') is NP-hard, then so is CSP{T).) 

Proof. We show how to solve CSP(r') in polynomial time, under the assumption that 
CSP(r) can be solved in polynomial time. Let S' be an instance of CSP(r'). Let P be 
the orbit of c in the automorphism group of V. By Theorem [TBI P is primitive positive 
definable in V. Thus we can assume without loss of generality that V and V contain the 
relation P. Replace all occurrences of the relation C in S' by the relation P. Solve the 
resulting instance S of CSP(T); by assumption this is possible in polynomial time. If S is 
not satisfiable, then in particular S' was not satisfiable. On the other hand, if there is a 
homomorphism h from S to T, we claim that there is a homomorphism from S' to V. Since 
P is the orbit of the element c, there is an automorphism a of T such that ha is a solution 



If an w-categorical model-complete core T is expanded by a singleton relation, the 
resulting structure V is again an o;-categorical model-complete core. The fact that u- 
categoricity is preserved by such expansions is well-known, see e.g. [25]. Suppose that I~" is 
not a model-complete core, i.e., there is a non-elementary endomorphism e of T' . Then e is 
also a non-elementary endomorphism of T, a contradiction. Hence, we can apply Theorem[T9l 
several times, and obtain that Theorem [19] also holds for expansions by a finite number of 
singleton relations. 
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